PrivacyPrivacy
Skip to main content
Intended for healthcare professionals
Open access
Research article
First published online April 15, 2024

The geographies and complexities of online networks in the off-street sex market

Abstract

Exploitation and human trafficking in sex markets tend to include both online and offline spaces. Understanding the scale, complexity and geography of networks is important in policing human trafficking and online escort adverts are often used to identify organised crime in this context. This article aims to make a methodological contribution to how data relating to online networks in the sex market can be collected and analysed. Through the application of web scraping, social network analysis and principal component analysis, the digital traces of 15,016 online networks operating on an adult services website were analysed in relation to their complexity and geographical patterning. The findings suggest that structural and geographical characteristics are useful for understanding the heterogeneity of online networks. Analysing networks, as opposed to assessing escort adverts, offers a more robust approach to understanding the sex market, which is more sensitive to the continuum of experiences encapsulated therein.

Introduction

The sex market encompasses heterogeneous populations and different actors, including organised crime groups (OCGs), who may facilitate exploitation (Skidmore et al., 2018). This may include offences captured in the broader definitions of modern slavery and human trafficking, which have often been criticised for being too nebulous and lacking precision (O’Connell Davidson, 2015). For the purposes of this article, the term ‘exploitation’ is preferred, since exploitation within the sex market is multifaceted and not always easy to classify as ‘human trafficking’ or ‘modern slavery’ (Kjellgren, 2022). Nevertheless, the criminal offence focussed on in this research is human trafficking (United Nations, 2000) and what is of interest is how it is embedded in online and offline spaces associated with the sex market.
Networks in the sex market are formed for a variety of reasons and may include sex workers working together, escort agencies, brothels, or criminal networks, in which the constellation might involve victims and offenders alike (Crocker et al., 2017; Hester et al., 2019; Sanders et al., 2018). As the sex market is increasingly mediated by technologies, including advertising sexual services on adult service websites (ASWs), networks inevitably leave digital traces online (Europol, 2020; Walby et al., 2016). Since there is a variety of constellations of actors operating in the off-street sex market, a certain degree of plurality in online networks would also be expected. Indeed, the structure of an online network is likely to be affected by the offline organisation of the network, or actor, if an independent sex worker posts the adverts (Kjellgren, 2023). In this context, there may be a latent dimension– pertaining to the complexity of online networks in the off-street sex market. Their structure, as observed from digital traces, is likely to approximate (albeit imperfectly) the offline structure of networks, in terms of their scale and spatio-temporal patterning. Since the off-street sex market is premised upon online advertising, data from escort adverts can illuminate the geographical markets that are targeted by sex workers and networks. Given how the off-street sex market, as opposed to the street-based market, is mediated by online technologies (Sanders et al., 2018), the contemporary policing of human trafficking and exploitation is likely to include intelligence and investigation pertaining to digital traces (Kjellgren, 2023; Skidmore et al., 2018); yet there is little research that has explored how to best capture the complexity and diversity of online networks, which is of significant importance for law enforcement to more accurately interpret digital traces. This research is not only important in relation to improving how open-source intelligence (OSINT) can be utilised, but also in highlighting that the sex market is underpinned by networks, and that we cannot infer the presence of exploitation, organised crime or criminal networks from this source of data alone.
This article is focussed on exploitation in the UK’s sex markets and the networks who may be facilitating it. More specifically, it is focussed upon the exploitation of adult women given the high rates of reported victimisation of this demographic (Home Office, 2023), and links between exploitation and ASWs (Sanders and Keighley, 2023). Drawing upon feminist and critical theoretical frameworks, which locate exploitation as part of a continuum of experiences and recognise a diversity of experiences within the sex market (e.g. Malloch and Rigby, 2016; O’Connell Davidson, 2015; Sanders et al., 2018), this article seeks to advance our understanding of the online dimension of the off-street sex market. It aims to evaluate the feasibility of using scraped data to understand the geographical patterning and diversity of networks and will address the following research questions:
Research Question 1: How can we understand the complexity of online networks operating in the UK’s off-street sex market?
Research Question 2: To what extent do networks differ in terms of their complexity?
This article will proceed to provide an overview of contemporary issues relating to the policing of criminal networks and OCGs in the sex market. It will subsequently describe the data and methods used in this research. Principal component analysis (PCA) will be used as a means of dimensionality reduction, to explore and describe the variability and heterogeneity of online networks. It will then discuss the geographical distribution of networks across the United Kingdom and highlight key differences between the 50 most complex networks and the other networks in the sample. To complement this analysis, it also highlights how four networks that span the continuum of complexity differ in their geographical patterning. The article concludes with a discussion of the significance of the identified patterns and the feasibility of using scraped data in assessing networks in the sex market.

Policing online sex markets in the United Kingdom

Human trafficking has previously been highlighted as one of the most complicated crimes to investigate (Pajón and Walsh, 2018), the complexity of the crime being attributable to several features. Human trafficking is a processual crime (Malloch and Rigby, 2016), consisting of three stages: recruitment, transportation and exploitation. It is also an essentially relational crime; the process of exploitation is predicated upon social relations, and patterns of victimisation and opportunities for offending are embedded within social networks (Verhoeven et al., 2013). Others have highlighted how human trafficking is fundamentally spatio-temporal (Cockbain et al., 2022), often involving the movement of victims across vast distances, and exploitation can occur over prolonged periods (Cockbain and Brayley-Morris, 2018). Policing the sex market is a challenging task, since it involves policing a market that comprised both autonomous, independent sex workers, and also individuals who are exploited by criminal networks and OCGs (National Police Chiefs’ Council (NPCC), 2019; Sanders et al., 2020). In addition, there are indications that exploitation within the sex markets of the United Kingdom is increasingly mediated by online technologies, which both change how OCGs within these spaces operate, and the role and utility of OSINT in policing contexts (Crocker et al., 2017). According to recent research, ASWs are attractive to OCGs and key online spaces for enabling human trafficking and, consequently, contain large volumes of potential intelligence (Sanders and Keighley, 2023).
Historically in the United Kingdom, the policing of sex markets has largely been focussed on more public aspects, including street-based sex work, with an emphasis on nuisance, public order, morality and third-party control (Scoular et al., 2019). With the sex market increasingly Internet-mediated (Sanders et al., 2018), traditional policing approaches are inadequate to respond to contemporary harms and exploitation (Scoular et al., 2019). Notwithstanding the digital element, contemporary policing has more recently been influenced by broader changes in the understanding of sex work as a distinct form of labour (Sanders et al., 2020), as opposed to more radical feminist views equating sex work with sexual violence or exploitation (e.g. the All-Party Parliamentary Group on Prostitution and the Global Sex Trade, 2018).
Recent guidance from the National Police Chiefs’ Council (NPCC, 2019) proposes an approach to policing that focusses on harm reduction and recognises the nuance of experiences encompassed within the sex market. This, for instance, involves a focus on responding to where the threats of harm, violence and exploitation are greatest, and the importance of building trust with the wider sex worker community to better address their needs (NPCC, 2019). Nevertheless, the United Kingdom encompasses a large number of distinct police forces and divisions, and there is substantial variability in how sex markets are policed locally (Sanders et al., 2020). This also includes how intelligence is gathered and collated; there is often limited intelligence on human trafficking, though intelligence-led policing is nevertheless important to disrupt and control this crime more effectively (Atkinson and Hamilton-Smith, 2020). Of specific relevance to this article is the role of OSINT, which in this context primarily consists of online escort adverts posted on ASWs. How such intelligence is generated and utilised similarly also varies between forces (Sanders et al., 2018; Scoular et al., 2019).
Previous research has provided suggestions on how to generate OSINT in this context. The social scientific literature has largely been focussed on the use of indicators to signal the presence of exploitation within the sex market – such as shared phone numbers or the advertising of ‘extreme services’ – through manual screening of online adverts and risk matrices (e.g. L’Hoiry et al., 2021; Skidmore et al., 2018). However, manually screening adverts is ineffective and will never successfully grasp the wider patterns and structures within the online sex market. Others have suggested big data-oriented approaches involving web scraping and machine learning to identify, predict or evaluate the presence of exploitation (e.g. Dubrawski et al., 2015; Giommoni and Ikwu, 2021; Ibanez and Suthers, 2014). Relying on indicators, whether through the manual screening of adverts, or by the application of computational approaches, to identify exploitation is prone to generating false-positives, or in other words, conflating independent sex workers with victims of exploitation or trafficking (Kjellgren, 2022). This is primarily because the indicators used are unable to effectively distinguish between organised sexual labour vis-à-vis organised exploitation (Kjellgren, 2023). Besides, escort adverts, as a source of data, do not contain the information required to infer the presence of exploitation; contextual information is required for this and what we can infer from an individual’s situation based on online data is severely limited. Moreover, such approaches also fail to recognise that exploitation falls along a wide continuum and categorising adverts as being ‘suspicious’ or ‘indicative’ of sex trafficking or exploitation risks obscuring the many nuances within the sex market (Kjellgren, 2022). Research focussed on generating OSINT relating to the sex market has so far neglected the importance of accounting for this continuum of exploitation (Kjellgren, 2022; Malloch and Rigby, 2016; O’Connell Davidson, 2015), even though it is crucial for understanding complex processes of victimisation.
The empirical realities of human trafficking are often quite distinct from popular constructions of the issue (Albanese et al., 2022) and experiences of exploitation do not necessarily map well onto the legal definitions of human trafficking (O’Connell Davidson, 2013). OSINT will continue to be important in the investigation of exploitation and human trafficking, and there is therefore a need to move beyond simple conceptualisations of human trafficking, to more theoretically informed approaches, which locate exploitation within a continuum of experiences (Kjellgren, 2022). Furthermore, because exploitation fundamentally is a relational phenomenon, it is imperative that any approach aimed at disentangling the complexities of the online sex market also focusses on networks, as opposed to individual online escort adverts.
One of the critical shortcomings of previous research is that it has been preoccupied with the content of escort adverts and the assumption that information encapsulated therein is useful to identify exploitation. Based on recent research with human trafficking investigators, it is clear that it is not necessarily the content of adverts that is relevant to investigators (Kjellgren, 2023). Rather, it is the scale, geography and structure of networks, as opposed to adverts, that can be useful for either augmenting ongoing investigations, or deriving new intelligence, which can be triangulated with pre-existing local intelligence (Kjellgren, 2023). The key, from an investigative point of view, is being able to quickly gauge the structural composition, scale of a network and geographical patterning. Since previous research has been focussed on escort adverts, as opposed to networks of adverts, variables with potential utility for both policing and better understanding the sex market have effectively been ignored.
There is both a need for advancing theory in this area, by considering how a continuum of exploitation may be represented in online data, and for developing more effective and scalable approaches to policing Internet-mediated exploitation. It is vital to consider what variables can conceivably be derived from scraped escort adverts and precisely how these may improve our understanding of networks within the sex market. Evidence of organisation is important for trafficking investigations and, arguably, what OSINT from ASWs should be focussed on (Kjellgren, 2023). However, since far from all organised forms of sexual labour can be considered trafficking or exploitation, a certain degree of caution and sensitivity is required. The notion of complexity is useful in this context: measuring and operationalising different aspects of complexity, including structural composition, network scale and geographic patterning, may improve how online networks in the sex market are evaluated. Consequently, with a greater appreciation of how networks differ in complexity, law enforcement practitioners may be in a better position to distinguish between different forms of actors in the sex market and thereby reduce the risk that independent sex workers are subjected to monitoring and surveillance.

Methods

Data source

This research used a data set consisting of 213,693 unique online escort adverts, nested in 15,016 online networks (what is meant by ‘networks’ will be described in more detail below). The data set was created as part of a larger mixed-methods project (see Kjellgren, 2023) and a web scraper (written in Python 3.8.5, using the scrapy package1) was used to collect adverts on a fortnightly basis, between January 2021 and April 2022. The adverts were collected from one of the UK’s most popular ASW (henceforth referred to as ASW 1). The rationale for selecting this particular website was informed by previous research in the United Kingdom (including Kjellgren, 2019). In addition, some ASWs require more rigorous verification procedures for posting adult services adverts, such as providing passport details (Sanders et al., 2018), and it is conceivable that criminal networks and OCGs would favour ASWs where they can provide as little information as possible, and ASW 1 is one such site. The poster is required to pay a fee to post an advert, which remains online for a limited time. The site in question includes several subcategories of adult services. Since this research is focussed on the exploitation of adult women, the category relating to escort services provided by women was used as the basis for data collection. It should be noted that there are subcategories of different genders on ASW 1 and it is conceivable that transwomen may also choose to post in this category; this analysis simply focussed on adverts posted under the category of ‘female’.

Data structure and the operationalisation of online networks

Since the data can be considered ‘found data’ (Connelly et al., 2016) – data occurring as a result of social processes and not created for the purpose of research – some further explanations are necessary. Escort adverts are posted by independent sex workers and criminal networks alike to connect with sex buyers (Crocker et al., 2017) and doing so generates a digital record of the online marketing strategies used (e.g. the textual descriptions, characteristics and services offered). This is a crucial point: an advert represents particular elements of the marketing strategies used by individuals or networks, and they do not necessarily represent a unique individual, nor are the details provided necessarily truthful (Holt et al., 2021; Kjellgren, 2022, 2023). As an example, the advertised nationality can be considered part of a repertoire of marketing strategies, since it can be advantageous to conceal or advertise as a different nationality to appeal to particular demands in the market, or to avoid stigma.
Both independent sex workers and criminal networks are likely to advertise sexual services in different places throughout time, since mobility is key to appealing to new markets and reaching a greater number of clients (Crocker et al., 2017; Scoular et al., 2019). As such, these activities generate digital traces that can be used to analyse how individuals and networks try to penetrate local sex markets throughout the country. In terms of geographical variables, location data are self-reported. To post an advert on the ASW, the poster is required to specify their postcode. The full postcode is not accessible to the public, instead, the geographical information contained in adverts consists of the postcode district and coordinates. For this research, the coordinates were linked to the geographically closest full postcodes and the local authority district (LAD) to which they were related.
In the context of policing organised crime in the sex market, it is crucial to understand where vulnerability and risks of exploitation may be most prominent, including the structure of networks operating therein. The escort adverts collected for this research were used to identify online networks within the sex market. An algorithm was created to form ties between adverts based on shared phone numbers and ASW 1 user accounts (see Kjellgren, 2023, for more details). An online network in this context, therefore, relates specifically to how adverts are connected by these two attributes. Different approaches were also tested in the early stages of this research, including measuring the textual similarity of adverts to form ties. However, based on input from human trafficking investigators as part of other research (Kjellgren, 2023), phone numbers and user accounts were deemed the most reliable and valid measures to establish links between adverts.
There is obviously limited information on the offline structure of these networks, since only their digital footprints can be observed. As such, the term ‘network’ is used to refer to how adverts are structured, rather than information pertaining to the social relations between known individuals. Therefore, the footprints of an independent sex worker can be perceived as a ‘network’, because of how the posted adverts are linked together. The use of the term ‘network’ to refer to a single node can appear inappropriate and contradictory to the normal terminology used in social network analysis. In this research context, it must be recognised that a single advert (i.e. an isolated node), may be representative of multiple individuals part of the offline network. In other words, we can never, with escort adverts as a source of data, accurately assess how many individuals are actually involved in the network. As such, using the term ‘network’ when referring to isolated nodes may seem counterintuitive, but the terminology reflects the uncertainty associated with escort adverts, and allows for the language used throughout the article to be more consistent and concise.
These networks emerge organically throughout time, as independent sex workers and other actors in the sex market advertise their services and move around throughout the country. This also means that a snapshot in time may give an inaccurate picture of the underlying complexity of a network; the first advert posted may not be particularly informative, since it will be difficult to elucidate relational patterns. Throughout time, what might have first appeared to be an isolated advert can potentially grow into a network, simply because the individual(s) in the offline network posts more adverts from the same phone numbers or user accounts.
The structure of the data used can best be described as longitudinal, relational and hierarchical. It is longitudinal because data were collected over an extended period and it is also operationalised as a relational matrix: the escort adverts are linked together, in this case by phone numbers and user accounts, to form empirical networks. Finally, it can also be considered hierarchical, since escort adverts (n = 213,693) are nested in these empirical networks (n = 15,016). The PCA was thus conducted on the network level, with variables also measured at this level (e.g. network characteristics, as opposed to advert level information).

Exploring network complexity: PCA

To inductively explore the data in terms of network complexity, PCA was deemed an appropriate method for dimensionality reduction. The purpose of this was to reduce several variables that each measured different dimensions of complexity (scale, structure and geographical dispersion) into one variable, which could then be used as a basis for exploring the diversity of networks in the sample. PCA is useful when there is a set of variables measuring different aspects of the same phenomenon, which, when combined, ideally preserves the maximum amount of variability. Through linear combinations of the input variables, a smaller set of uncorrelated principal components (PCs) is derived, aimed at maximising the amount of explained variances in the latent dimensions, or PCs (Härdle and Simar, 2019). The results from the PCA suggested the first component to explain a reasonable amount of variance, while also representing a plausible way of ranking networks in terms of the relative complexity of their digital footprint. There were several iterations of the PCA, including different network-level variables operationalised in different ways. The final iteration, which included the variables shown in Table 1, was selected because of its theoretical and empirical value. This combined measurement – or PC – which will be referred to as the ‘network complexity score’ (NCS) throughout the article, is the result of the linear combinations of the values of the input variables specific to the data set used (Härdle and Simar, 2019). In other words, if the same method was applied, using the same input variables, but on a different and unrelated data set, the derived values would be different. In terms of transferability, whereas the different dimensions of complexity (network scale, geography and structure) are theoretically valid and transferable, the empirical contributions of each of the input variables for measuring complexity will be variable. As such, this work should be perceived as exploratory and an initial step to constructing a more robust, reliable and replicable scale in future research.
Table 1. Variables in the network complexity scale.
DimensionVariableOperationalisationDescription
1. Network scaleNumber of user accountsCount of the number of distinct user accounts used by the network.Gives a reasonable indication of the scale of a network. It is unlikely to directly translate to the number of people involved; however, a larger number of user accounts could likely indicate that more people are required to maintain the operation of the network
1. Network scaleNumber of advertised ethnicities.Count of the number of distinct ethnicities (of 12 possible) advertised by the networkWhereas, ethnicity certainly can be considered a marketing strategy, a larger number of advertised ethnicities could indicate that more than one person is involved
2. GeographyGeographical dispersionThe standard deviation of the distance between each pair of adverts within a network, as measured in kilometres. A standard deviation of 0 would indicate no dispersion and a larger standard deviation indicates larger geographical dispersionThis is important to both understand the geographical scale of a network and the potential movement patterns involved
3. Network structureDensityThe prevalence of dyadic relationships, in which a value of 1 indicates that all theoretically possible combinations of ties are formed and lower values indicate unformed combinations of tiesDensity provides a measure of how densely connected a network is. A high density means that adverts are largely connected by the same phone numbers and user accounts, which likely would be the case for independent sex workers. The opposite may be true for more distinct forms of offline organisations
3. Network structureAverage shortest pathThe average of the shortest paths between all pairwise combinations of adverts within a network. A lower number would indicate a cohesive network, whereas a higher number means that it is more dispersedAn important measure of network topology that captures how dispersed a network is, in terms of its connections
3. Network structureDiameterThe longest of all shortest paths between all pairwise combinations of adverts within a network. A higher number indicates both a dispersed and potentially larger networkAnother important topological measure that also feeds into the network size
Open in viewer

Variables

The combination of the variables listed in Table 1 taps into three different dimensions of network complexity: (1) network scale, (2) geographical dispersion and (3) network structure. The number of advertised ethnicities is arguably the only variable that largely represents marketing strategies, and it is also plausible that it will more frequently be used (as a marketing strategy or a reflection of reality) in larger networks. The relational measures used are, to some extent, a function of the number of phone numbers and user accounts, but more importantly, also the consistency in how they are used. In other words, networks will become more dispersed if a network frequently changes its phone numbers and user accounts.

Network case studies

A more qualitative and descriptive approach was deemed necessary to better illustrate how networks of varying complexity differ. Following the PCA, four networks with varying complexity scores were purposively selected for a more in-depth analysis. Because network size, to some extent, is influenced by the length of time a network is operating, it was important to choose networks that had been around for long enough to establish a presence. All networks selected had been present on the website for a year or longer. The comparison was focussed on the geographical distribution of adverts and the textual description of the adverts. This comparison also involved an ad hoc examination of digital traces associated with the networks outwith ASW 1. Some of the more prominent phone numbers were used to search for online sex buyer reviews, and other characteristics – such as the name of escort agencies – were also searched for to better understand the context of the posted adverts.

Ethical considerations

Researching the sex market is complicated and doing so through the analysis of online data poses unique challenges. First, unlike a survey, in which informed consent by research participants would be gained in advance, this was not possible with regard to the online escort adverts. However, whereas a survey would collect data on individuals, the escort adverts in this research did not pertain to individuals per se, but rather, the professional lives of sex workers and other actors in the sex market. In other words, the data collected represented digital traces associated with marketing strategies, which were posted online on publicly accessible platforms, and there is, as such, not necessarily a perception or expectation of privacy with regard to the data posted. While the data collected are not personal data per se, they were nevertheless treated as such, to protect the integrity and anonymity of the individuals posting the adverts. For the geographical figures and qualitative descriptions on networks with few cases, the details regarding locations have been modified, to protect the anonymity of the poster(s). Even so, this does not mean that the analysis of online escort adverts is unproblematic.
Indeed, the analysis of online escort adverts is inevitably a form of surveillance. UK police forces are already either monitoring or using adverts as part of their investigations into exploitation, and as established in previous sections, research using escort adverts has been conducted for several years. The problem, however, both with regards to policing and research, is that the uncritical collection and analysis of online adverts pose serious risks of causing harm to both individual sex workers and, more widely, sex worker communities. Individual sex workers risk having their adverts targeted by law enforcement as being indicative of potential trafficking. The consequences can be varied, from unwanted attention to detention and deportation. As for the wider sex worker communities, and as argued elsewhere (e.g. Kjellgren, 2022), poor research involving online data on the sex market, often motivated by particular anti-sex work ideologies, risks producing seriously misleading estimates or drawing simplistic conclusions and committing a particular form of digital fallacy: conflating online information and behaviours as reflecting an independent offline reality. Caution should always be used when analysing online escort adverts and this research attempts to both improve upon how the police utilise OSINT from adverts and, furthermore, to critically examine the value of escort adverts for social inquiries into the sex market. This research, as such, has the potential to provide a more nuanced understanding of the sex market, in which any evidence of organisation is not misinterpreted as evidence of trafficking or exploitation.
This research received ethical approval from the University of Stirling’s General University Ethics Panel on 8 October 2020 (reference number: GUEP/20/21/1007).

Findings

Exploring network complexity through PCA

The results from the PCA are shown in Table 2 and they indicate that PC1 explains a reasonable proportion of the variance in the underlying dimension (0.63), which in this instance, may relate to an unmeasured (latent) dimension of complexity. The other components (e.g. PC2–6) represent other (uncorrelated) dimensions of these networks, and while not within the scope of the current analysis, may nevertheless be interesting. PC2, for instance, is strongly and positively correlated with geographical dispersion, and modestly and negatively correlated with the number of advertised ethnicities and user accounts; this could perhaps indicate highly mobile independent sex workers. With regards to PC1, however, it can be observed that all the variables are positively correlated with PC1, except for density; this is a result of how density is measured. A density of 1.0 indicates that all theoretically possible combinations of ties are formed in the network and a lower density thus indicates the absence of possible ties, which renders its network structure more fragmented.
Table 2. PCA: network complexity.
 PC1PC2PC3PC4PC5PC6
Geographical dispersion0.250.920.05−0.280.150.01
Density−0.450.16−0.09−0.27−0.770.30
Diameter0.50−0.050.080.27−0.090.81
Average shortest path0.460.12−0.080.45−0.58−0.48
Number of advertised ethnicities0.37−0.20−0.78−0.460.000.00
Number of user accounts0.38−0.280.60−0.59−0.20−0.13
Standard deviation1.940.930.760.670.550.21
Proportion of variance0.630.150.100.080.050.01
Cumulative proportion0.630.770.870.940.991.00
n = 15,016
Open in viewer
Without explicitly measuring the network size (as could be defined by the number of adverts), the benefit of using PCA to explore complexity is that it can capture important dimensions that are not solely related to size. It is, nevertheless, quite strongly correlated with the total number of adverts posted by networks (r(15,014) = .61, p < .001) and the average number of adverts posted each month by networks (r(15,014) = .51, p < .001). PC1 can be perceived as representing one dimension of complexity, which is sensitive to the fact that smaller networks can be quite complex, yet it also captures the importance of network size in understanding processes in the online sex market. It is conceivable, though not possible to demonstrate with this data alone, that the more complex an online network appears, the more complex the organisation of the offline network may be. For instance, the use of multiple user accounts and a geographically dispersed pattern may suggest that a higher level of coordination and logistical capabilities are involved.
After extracting the first component, it was rescaled to range between 1 and 10 to aid interpretability. This measurement can be perceived as representing a ‘NCS’, in which each of the networks in the sample is ranked from the least to most complex, as defined by the combination of input variables. The distribution of NCSs is shown in Figure 1. The distribution is heavily skewed (skewness = 3.8) and can be considered leptokurtic (kurtosis = 35.6). The average NCS is 1.3, with a standard deviation of 0.4. The skewness aligns with what we would expect: most networks tend to be fairly simple, likely indicating independent sex workers or smaller collectives of sex workers, with comparatively few highly complex networks. The simplest form of network is simply an unconnected, single advert. The most complex network observed was assigned a score of 10; this is a truly large and complex network responsible for posting over 3000 adverts in the study period. As can be seen, it is considerably more complex than the other observed networks and this is one of the case study networks (network 124) presented below.
Figure 1. Distribution of network complexity scores.
Open in viewer
Figure 2 shows the distribution of networks by LADs. The table to the left shows the 20 LADs with the highest rate of networks per 100,000 population. First, it can be observed that all but one of the LADs (Isle of Scilly) had one or more networks in the research period. The average number of networks per local authority is 105.8 (SD = 117.3) and the median is 59.5. This, of course, does not tell us if these networks in fact were present in these local authorities; it does, however, suggest that the observed networks are marketing their services across the whole of the United Kingdom. Birmingham has the highest number of networks (1126) and some of the London boroughs also have a substantial number of networks. The cities of Manchester, Leeds, Glasgow, Leicester, Liverpool and Edinburgh also appear to be targeted by a significant number of networks.
Figure 2. Distribution of networks by local authority districts.
The number of unique networks (n = 15,016) by local authority districts (2022) and mid-year population estimates (2021).
Open in viewer
In Figure 3, the 50 most complex networks are shown by LADs. These 50 networks can be observed in all but 45 LADs (shaded in grey). The average number of complex networks in LADs is 4.4 (SD = 3.8), with a median of 3. Some of the most popular locations of these 50 networks include Manchester, Reading, Bristol, Glasgow and Slough. In addition to these, some slightly smaller cities and locations also appear to be quite prominent, including Oxford, Derby, Dundee and Maidstone. Even though the graph only shows 50 networks, the geographical footprints are quite widespread and it is clear that some quite remote and sparsely populated areas (such as the Scottish Highlands) may also be lucrative to complex networks.
Figure 3. Distribution of the top 50 most complex networks by local authority districts.
The number of unique networks (n = 50) by local authority districts (2022) and mid-year population estimates (2021).
Open in viewer
Table 3 shows a comparison of the 50 most complex networks to the other observed networks in the sample (n = 14,966). The most complex networks can be characterised by, on average, operating a substantial number of user accounts and phone numbers, posting a higher number of adverts each month, advertising multiple nationalities and ethnicities, and having a greater variability in the advertised age (perhaps indicating the presence of several individuals). In terms of geography, complex networks are, on average, more likely to have posted across a greater number of LADs and a higher average geographical distance between the posted averts. In terms of their structural composition, the most complex networks are, on average, signified by a lower network density and being more dispersed.
Table 3. Comparison of top 50 most complex networks and other networks.
VariableGroupMSDMedianMin.Max.
Network complexity scaleOther networks1.250.351.2213.38
Top 50 complex networks4.181.063.913.3910
Days active on ASWOther networks334.21310.5629713270
Top 50 complex networks599.9298.85521901842
User accountsOther networks1.511.621137
Top 50 complex networks31.9424.5625.59164
Phone numbersOther networks2.223.6411135
Top 50 complex networks59.466.3137.510399
AdvertsOther networks12.7839.97311846
Top 50 complex networks447.94499.22309.5303180
Adverts posted/monthOther networks1.491.361129.77
Top 50 complex networks7.987.385.841.646.76
Advertised nationalitiesOther networks1.591.681157
Top 50 complex networks11.49.489149
Advertised ethnicitiesOther networks1.330.74119
Top 50 complex networks4.681.714210
Advertised age (mean age of adverts within a network)Other networks27.456.98251860
Top 50 complex networks23.772.4523.2519.235.5
Advertised age (SD of adverts within a network)Other networks0.591.150027.6
Top 50 complex networks2.271.491.90.69.1
LADs posted inOther networks2.644.3811103
Top 50 complex networks32.6832.0721.51153
Average distance between pairwise combinations of adverts in a network (in kilometres)Other networks32.3672.710.310770.83
Top 50 complex networks131.1190.12107.40.8322.2
Network densityOther networks0.950.1510.091
Top 50 complex networks0.140.070.130.020.37
Network average shortest pathOther networks0.60.62104.09
Top 50 complex networks3.921.33.661.919.18
Network DiameterOther networks0.81.131011
Top 50 complex networks10.43.6810524
Comparison of the 50 most complex networks to the remaining networks in the sample (n = 14,966), across 15 variables related to network structure, scale, geography and marketing strategies.
Open in viewer

The complexity continuum: Network case studies

Network 43

This network consists of 22 adverts and it appears to have operated on this website for close to 7 years (note that the collected data only cover a limited period). In this case, the adverts appear to be posted by an independent Scottish sex worker based in the central belt of Scotland. ‘Scottishness’ is highlighted and potentially serves as an important marketing strategy. The consistency of the characteristics listed in the adverts, and the language used, reinforces the notion that this is likely to be an independent, British sex worker. This advert network also demonstrates how it is not uncommon for independent sex workers to display a certain degree of mobility – in other words, touring (Scoular et al., 2019). In this case, while the main location is in the central belt of Scotland, adverts are routinely posted along a circuit around some of the larger markets of the North East and Highlands.

Network 202

An interesting feature of this network, which is made up of 410 adverts, is that it appears to consist of two different escort agencies, with the primary one being in London and the second one in Kent. Being an agency, it appears to include a quite high number of sex workers of varying ages and primarily of White non-British ethnicity. The primary agency has a clear online presence outwith the ASW and their dedicated website has been active since 2015. The secondary agency only operated for a limited time, in which the activities of the primary agency ceased. Overall, all the adverts are posted in either Kent or locations in or close to London.

Network 2618

This network is much more complex than the previous two networks in many aspects. It specifically lists Brazilian or Colombian as the nationality in all 1273 adverts. It also highlights that the escorts are ‘independent’ and similarly describes the novelty of encounters with ‘sexy latinas’ as one of their marketing strategies. Online sex buyer reviews also suggest the nationalities listed as truthful marketing strategies. What really sets the network apart, however, is its widespread geographical dispersion. Indeed, it appears that regular movement is a feature of this network, with a strong presence in some of the largest markets in the country – London, Birmingham, Manchester and Glasgow. It might possibly reflect a larger offline organisation of people, given that over 250 phone numbers and 54 user accounts have been used in the research period alone.

Network 124

This is the most complex network that was operating throughout the research period. Numerous massage parlours or spas, with an online presence outwith the advertising website, are involved in this network. The network has been operating for a significant period and has accumulated a lot of reviews from sex buyers, most of whom describe the women as Asian. It also includes a multitude of advertised nationalities, including Thai, Romanian, Turkish, Indian and Taiwanese, though not British. Leicester appears to be a core location for the network, but it is consistently also maintaining a presence in all regions of the United Kingdom. A total of 399 phone numbers and 164 user accounts have been used to post 3180 adverts during the period of data collection.
The distribution of adverts in UK LADs for the four networks is shown in Figure 4. Here, the number of adverts in a LAD is shown by colour and the grey shading indicates that no adverts were posted in those LADs. A network can operate across a multitude of locations within a single local authority, or one of the larger metropolitan areas; this does not mean that it is necessarily vastly geographically dispersed. This is exemplified by network 202. Conversely, a network may post in relatively few locations, such as network 43 that stretches substantial distances. What truly makes a network complex, however, is the combination of posting in many unique locations spanning large geographical distances, such as networks 2618 and 124.
Figure 4. Geographical distribution of adverts for the four case study networks.
Open in viewer

Discussion and concluding remarks

This exploratory research was aimed at evaluating the feasibility of using scraped data to understand online networks, and in doing so, providing a more detailed description of the variety of networks operating in the UK’s off-street sex market. It is conceivable, though challenging to establish empirically, that the underlying processes contributing to the online advertisement of sexual services are likely to be different for different actors and networks and that the subsequent online footprints would likely also be variable.
In terms of the research questions, this research has demonstrated that it may be feasible to continue to explore how network complexity may best be measured and operationalised from scraped data; both for understanding the overall structure and organisation of the sex market but, more specifically, because it may be useful for law enforcement practitioners to better assess networks in the sex market. The results from the PCA indicate that the combination of variables linked to the network scale, geographical dispersion and structural composition can be useful for understanding the variability of networks in the sex market. The principal merit of developing a measure that is not solely based on network scale (such as volume of adverts), is that it may be more sensitive to nuances within the sex market, and thereby capture networks that are of operational interest to law enforcement. However, future research should focus on examining the feasibility of developing more robust, replicable and reliable measures, which has been a limitation of this exploratory research.
With regards to the second question – the extent to which networks differ in their complexity, the analysis suggests two things. First, the distribution of NCSs suggests the presence of a heavily skewed continuum of complexity: the norm appears to be simpler types of networks with a limited number of adverts. Large-scale networks with more widespread geographical footprints are considerably more uncommon. This is well in line with what we might expect, as independent sex workers and smaller sex worker collectives are likely to make up the majority of the sex market (Mai, 2009; Sanders et al., 2018; Scoular et al., 2019). Of course, some of the simpler networks – particularly the isolated adverts – may in fact be complex networks ‘in the making’; time will eventually tell if such observations are more akin to network 43, which appeared to indicate an independent sex worker, or if the network will continue to grow in complexity as more adverts are posted.
The comparison of the four different networks illustrated how the footprints of networks of varying levels of complexity are manifested online. While it is impossible to know precisely what goes on offline, the networks presented illustrated different types of structures that we might expect to find in the sex market, including independent sex workers, online escort agencies, migrant networks and large-scale networks with highly complex digital footprints.
Network 2618 was largely structured around ethnic lines. We would of course expect potential migrant networks to exist within the sex market – not only because a significant part of the sex market is made up of migrants (Mai, 2009) – but also because a lack of social and location-specific human capital might make cooperation and the organisation into collectives more likely (Kjellgren, 2023). However, the scale and sheer complexity of 2618 and 124 appear somewhat unique. The scale of network 124 possibly implies the underlying network is more organised than what we would expect from a collective of migrant sex workers. This should not be confused with the suggestion that these involve trafficking or exploitation; it needs to be recognised, however, that for such large networks to operate under a prolonged period, there are potential criminal implications for those running such a network (e.g. brothel-keeping or money laundering charges). With more risks involved, the working conditions could potentially be more exploitative for sex workers, as the facilitators must make the profits worth the risks they are taking. Perhaps, in such circumstances, we could also hypothesise a form of large-scale commercial exploitation to be more likely.
It is of both theoretical and empirical importance to better understand how networks in the sex market vary in terms of structure, geography and complexity. This has perhaps been the first attempt to move beyond evaluating the presence of trafficking by the use of indicators, and rather, offered a novel approach to more parsimoniously rank networks based on their relative complexity, where the unit of analysis is networks, as opposed to escort adverts. This brings us closer to a more nuanced understanding of the sex market, which recognises the following four premises:
1.
There is a continuum of organisation within the sex market and networks are anticipated to vary in complexity along this continuum.
2.
There is a continuum of complexity with regard to the structural characteristics of networks operating within the sex market.
3.
There is a continuum of vulnerability and exploitation within the sex market.
4.
The online footprints of networks are likely to approximate the offline organisation in terms of their scale, structure and complexity, but their marketing strategies are not necessarily truthful.
The utility of this approach is that it recognises that it is unrealistic to identify and pinpoint trafficking and exploitation from intelligence gleaned from online adverts, but that the structure of the observed online networks potentially can tell us something about their offline organisation. Pitcher (2015) has previously highlighted how the sex market involves a variety of employment or labour forms, including independent sex workers, collectives, brothels and agencies; a continuous operationalisation of network complexity allows us to better recognise these nuances. It is also sensitive to the proposition made by Scoular et al. (2019), namely, that sexual labour facilitated by a third party is not necessarily indicative of control or exploitation.
Computational methods and web scraping are potentially important to police organised crime in the sex market, and it must also be recognised that the implementation of algorithms (even if they are not applied predictively) to collect and analyse open-source data is not unproblematic (Kjellgren, 2023). Indeed, the process of collecting and analysing data on everyone, as opposed to individuals under suspicion, has previously been referred to as dragnet surveillance (Brayne, 2020). In this research context, all escort adverts are collected and scrutinised, including those posted by independent sex workers. The application of dubious and unsubstantiated indicators of human trafficking, as argued by De Vries and Cockbain (2024: 1), ‘can give an undue illusion of objectivity and reliability when they are neither neutral nor unskewed’. In other words, moving beyond indicators and situating trafficking and exploitation along a continuum, is arguably a more suitable approach for policing the sex market and focussing on advert networks rather than adverts makes it easier to avoid the over-policing of independent sex workers, or smaller sex worker collectives. However, because escort adverts are a highly epistemologically complex source of data, underpinned by a number of uncertainties, the application of OSINT will be most successful when used reactively and in triangulation with more robust offline intelligence.
To conclude, this research used web scraping, social network analysis and PCA to evaluate the feasibility of using scraped data to understand networks in the off-street sex market. It has been argued that examining the complexity of networks, in terms of their scale, structure and geography, may be more feasible for generating OSINT on criminal networks in the sex market, as opposed to assessing the information contained within adverts. The networks examined in this analysis were demonstrated to span a continuum of complexity, ranging from what could conceivably be identified as an independent sex worker, to large-scale networks with a geographical presence in large parts of the United Kingdom. A limitation of working exclusively with open-source data, however, is naturally that it is impossible to understand the offline contexts in which these networks operate and the number of individuals possibly involved. Future research should seek to combine (past) investigative police data with open-source data to examine the correlations between online and offline structures of criminal networks and OCGs and continue to evaluate the potential of different quantitative measures that can capture the variability and diversity of online networks. The methods for data collection, operationalising networks and exploring complexity can serve as an important starting point for more thorough examinations of how to best understand and police the online dimension of exploitation and trafficking.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: This research was funded by the Economic and Social Research Council (ESRC), through the Scottish Graduate School of Social Sciences, as part of a 1 + 3 MSc and PhD studentship.

ORCID iD

Footnote

References

Albanese J, Broad R, Gadd D (2022) Consent, coercion, and fraud in human trafficking relationships. Journal of Human Trafficking 8(1): 13–32.
All-Party Parliamentary Group on Prostitution and the Global Sex Trade (2018) Behind closed doors: Organised sexual exploitation in England and Wales. Report, All-Party Parliamentary Group on Prostitution and the Global Sex Trade, London, May.
Atkinson C, Hamilton-Smith N (2020) Still an ‘invisible crime’? Exploring developments in the awareness and control of human trafficking in Scotland. European Journal of Criminology 19(2): 147737082093186.
Brayne S (2020) Predict and Surveil: Data, Discretion, and the Future of Policing. New York: Oxford University Press.
Cockbain E, Brayley-Morris H (2018) Human trafficking and labour exploitation in the casual construction industry: An analysis of three major investigations in the UK involving Irish traveller offending groups. Policing 12(2): 129–149.
Cockbain E, Bowers K, Hutt O (2022) Examining the geographies of human trafficking: Methodological challenges in mapping trafficking’s complexities and connectivities. Applied Geography 139: 102643.
Connelly R, Playford CJ, Gayle V, et al. (2016) The role of administrative data in the big data revolution in social science research. Social Science Research 59: 1–12.
De Vries I, Cockbain E (2024) Governing through indicators: Structural biases and empirical challenges in indicator-based approaches to human trafficking research and policy. In: Faulkner E (ed.) Modern Slavery in the Global Context. Bristol: Bristol University Press.
Dubrawski A, Miller K, Barnes M, et al. (2015) Leveraging publicly available data to discern patterns of human-trafficking activity. Journal of Human Trafficking 1(1): 65–85.
Europol (2020) The challenges of countering human trafficking in the digital era. Report, Europol Operations Directorate, The Hague, October.
Giommoni L, Ikwu R (2021) Identifying human trafficking indicators in the UK online sex market. Trends in Organized Crime. Epub ahead of print 17 September.
Härdle WK, Simar L (2019) Applied Multivariate Statistical Analysis. Cham: Springer.
Hester M, Mulvihill N, Matolcsi A, et al. (2019) The nature and prevalence of prostitution and sex work in England and Wales today. Report, University of Bristol, Bristol, October.
Holt V, Kenway E, Berry A (2021) Sex workers as collateral damage, once again: A critique of the new ‘Sex Trafficking Identification Matrix’ Tool. Sex Worker Advocacy and Resistance Movement. Available at: https://www.swarmcollective.org/blog/2021/9/28/sex-workers-as-collateral-damage-once-again-acritique-of-the-new-sex-trafficking-identification-matrix-tool (accessed 21 May 2023).
Home Office (2023) Modern slavery: National referral mechanism and duty to notify statistics UK, end of year summary 2022. Available at: https://www.gov.uk/government/statistics/modern-slavery-national-referral-mechanism-and-duty-to-notify-statistics-uk-end-of-year-summary-2022
Ibanez M, Suthers DD (2014) Detection of domestic human trafficking indicators and movement trends using content available on open internet sources. In: 2014 47th Hawaii international conference on system sciences, Hilton Waikoloa, Hawai’i, USA, 6-9 January 2013, pp. 1556–1565. New York: ACM.
Kjellgren R (2019) Deciphering digital networks: Using open-source data to assess the role of ethnicity and nationality in structuring the off-street sex market. MSc Dissertation, University of Stirling, Stirling.
L’Hoiry X, Moretti A, Antonopoulos GA (2021) Identifying sex trafficking in adult services websites: An exploratory study with a British police force. Trends in Organized Crime. Epub ahead of print 5 May.
Mai N (2009) Migrant workers in the UK sex industry: Full research report. Report, Economic and Social Research Council, Swindon.
Malloch M, Rigby P (2016) Contexts and complexities. In: Malloch M, Rigby P (eds) Human Trafficking: The Complexities of Exploitation. Edinburgh: Edinburgh University Press, pp. 1–17.
National Police Chiefs’ Council (NPCC) (2019) National policing sex work and prostitution guidance. Report, the National Police Chiefs’ Council, London, January.
O’Connell Davidson J (2013) Troubling freedom: Migration, debt, and modern slavery. Migration Studies 1(2): 176–195.
Pajón L, Walsh D (2018) Proposing a theoretical framework for the criminal investigation of human trafficking crimes. Policing 14: 493–511.
Pitcher J (2015) Sex work and modes of self-employment in the informal economy: Diverse business practices and constraints to effective working. Social Policy and Society 14(1): 113–123.
Sanders T, Keighley R (2023) The role of adult service websites in addressing modern slavery. Report, Modern Slavery and Human Rights Policy & Evidence Centre, University of Leicester, Leicester, November.
Sanders T, Vajzovic D, Brooks-Gordon B, et al. (2020) Policing vulnerability in sex work: The harm reduction compass model. Policing and Society 31(9): 1100–1116.
Skidmore M, Garner S, Desroches C, et al. (2018) The threat of exploitation in the adult sex market: A pilot study of online sex worker advertisements. Policing 12(2): 210–218.
United Nations (2000) Protocol to prevent, suppress and punish trafficking in persons especially women and children. Available at: https://treaties.un.org/pages/ViewDetails.aspx?src=TREATY&mtdsg_no=XVIII-12-a&chapter=18
Verhoeven M, van Gestel B, de Jong D, et al. (2013) Relationships between suspects and victims of sex trafficking. Exploitation of prostitutes and domestic violence parallels in Dutch trafficking cases. European Journal on Criminal Policy and Research 21(1): 49–64.
Walby S, Apitzsch B, Armstrong J, et al. (2016) Study on the gender dimension of trafficking in human beings: Final report. Report, European Union, Migration and Home Affairs, Luxembourg, June.

Biographies

Richard Kjellgren is a Research Fellow at the University of Stirling. His current research is focussed on linked administrative health and justice data, and responses to exploitation.

Cite article

Cite article

Cite article

OR

Download to reference manager

If you have citation software installed, you can download article citation data to the citation manager of your choice

Share options

Share

Share this article

Share with email
EMAIL ARTICLE LINK
Share on social media

Share access to this article

Sharing links are not relevant where the article is open access and not available if you do not have a subscription.

For more information view the Sage Journals article sharing page.

Information, rights and permissions

Information

Published In

Article first published online: April 15, 2024

Keywords

  1. Criminal networks
  2. human trafficking
  3. online spaces
  4. open-source intelligence
  5. organised crime
  6. policing

Rights and permissions

© The Author(s) 2024.
Creative Commons License (CC BY 4.0)
This article is distributed under the terms of the Creative Commons Attribution 4.0 Lficense (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).

Authors

Notes

Richard Kjellgren, Salvation Army Centre for Addiction Services and Research, University of Stirling, Stirling FK9 4LA, UK. Email: r.r.kjellgren@stir.ac.uk

Metrics and citations

Metrics

Journals metrics

This article was published in Criminology & Criminal Justice.

VIEW ALL JOURNAL METRICS

Article usage*

Total views and downloads: 277

*Article usage tracking started in December 2016


Altmetric

See the impact this article is making through the number of times it’s been read, and the Altmetric Score.
Learn more about the Altmetric Scores



Articles citing this one

Receive email alerts when this article is cited

Web of Science: 0

Crossref: 0

There are no citing articles to show.

Figures and tables

Figures & Media

Tables

Table 1. Variables in the network complexity scale.
Table 2. PCA: network complexity.
Table 3. Comparison of top 50 most complex networks and other networks.

View Options

View options

PDF/ePub

View PDF/ePub

Get access

Access options

If you have access to journal content via a personal subscription, university, library, employer or society, select from the options below:


Alternatively, view purchase options below:

Purchase 24 hour online access to view and download content.

Access journal content via a DeepDyve subscription or find out more about this option.

View figure
Figure 1
Figure 1. Distribution of network complexity scores.
View figure
Figure 2
Figure 2. Distribution of networks by local authority districts.
The number of unique networks (n = 15,016) by local authority districts (2022) and mid-year population estimates (2021).
View figure
Figure 3
Figure 3. Distribution of the top 50 most complex networks by local authority districts.
The number of unique networks (n = 50) by local authority districts (2022) and mid-year population estimates (2021).
View figure
Figure 4
Figure 4. Geographical distribution of adverts for the four case study networks.
Table 1
Table 1. Variables in the network complexity scale.
Table 2
Table 2. PCA: network complexity.
Table 3
Table 3. Comparison of top 50 most complex networks and other networks.